智能论文笔记

Computer vision based vehicle tracking as a complementary and scalable approach to RFID tagging

Pranav Kant Gaur , Abhilash Bhardwaj , Pritam Shete , Mohini Laghate , Dinesh M Sarode

分类：计算机视觉

2022-09-13

传入/传出车辆的记录是根本原因分析的关键信息，以打击各种敏感组织中的安全违规事件。 RFID标记会阻碍物流和技术方面的车辆跟踪解决方案的可扩展性。例如，要求标记为RFID的每个传入车辆（部门或私人）是严重的限制，并且与RFID一起检测异常车辆运动的视频分析是不平凡的。我们利用公开可用的计算机视觉算法实现，使用有限状态机形式主义开发可解释的车辆跟踪算法。国家机器将用于状态转换的级联对象检测和光学特征识别（OCR）模型中的输入。我们从系统部署站点中评估了75个285辆车的视频片段中提出的方法。我们观察到检测率受速度和车辆类型的影响最大。当车辆运动仅限于在检查点类似于RFID标记的检查点时，将达到最高的检测率。我们进一步分析了700个对Live DATA的车辆跟踪预测，并确定大多数车辆数量预测误差是由于无法辨认的文本，图像布鲁尔，文本遮挡，文本遮挡和vecab外字母引起的。为了进行系统部署和性能增强，我们希望我们正在进行的系统监控能够提供证据，以在安全检查点上建立更高的车辆通知SOP，并将已部署的计算机视觉模型和状态模型的微调驱动为建立拟议的方法作为RFID标记的有希望的替代方法。

translated by 谷歌翻译

Space optimal and asymptotically move optimal Arbitrary Pattern Formation on rectangular grid by asynchronous robot swarm

Avisek Sharma , Satakshi Ghosh , Pritam Goswami , Buddhadeb Sau

分类：机器人

2022-12-06

Arbitrary pattern formation (\textsc{Apf}) is well studied problem in swarm robotics. The problem has been considered in two different settings so far; one is in plane and another is in infinite grid. This work deals the problem in infinite rectangular grid setting. The previous works in literature dealing with \textsc{Apf} problem in infinite grid had a fundamental issue. These deterministic algorithms use a lot space of the grid to solve the problem mainly because of maintaining asymmetry of the configuration or to avoid collision. These solution techniques can not be useful if there is a space constrain in the application field. In this work, we consider luminous robots (with one light that can take two colors) in order to avoid symmetry, but we carefully designed a deterministic algorithm which solves the \textsc{Apf} problem using minimal required space in the grid. The robots are autonomous, identical, anonymous and they operate in Look-Compute-Move cycles under a fully asynchronous scheduler. The \textsc{Apf} algorithm proposed in [WALCOM'2019] by Bose et al. can be modified using luminous robots so that it uses minimal space but that algorithm is not move optimal. The algorithm proposed in this paper not only uses minimal space but also asymptotically move optimal. The algorithm proposed in this work is designed for infinite rectangular grid but it can be easily modified to work in a finite grid as well.

translated by 谷歌翻译

XKD: Cross-modal Knowledge Distillation with Domain Alignment for Video Representation Learning

Pritam Sarkar , Ali Etemad

分类：计算机视觉

2022-11-25

We present XKD, a novel self-supervised framework to learn meaningful representations from unlabelled video clips. XKD is trained with two pseudo tasks. First, masked data reconstruction is performed to learn modality-specific representations. Next, self-supervised cross-modal knowledge distillation is performed between the two modalities through teacher-student setups to learn complementary information. To identify the most effective information to transfer and also to tackle the domain gap between audio and visual modalities which could hinder knowledge transfer, we introduce a domain alignment strategy for effective cross-modal distillation. Lastly, to develop a general-purpose solution capable of handling both audio and visual streams, a modality-agnostic variant of our proposed framework is introduced, which uses the same backbone for both audio and visual modalities. Our proposed cross-modal knowledge distillation improves linear evaluation top-1 accuracy of video action classification by 8.4% on UCF101, 8.1% on HMDB51, 13.8% on Kinetics-Sound, and 14.2% on Kinetics400. Additionally, our modality-agnostic variant shows promising results in developing a general-purpose network capable of handling different data streams. The code is released on the project website.

translated by 谷歌翻译

Replay based for Recovering Autonomous Robotic Vehicles from Sensor Deception Attacks

Pritam Dash , Guanpeng Li , Mehdi Karimibiuki , Karthik Pattabiraman

分类：机器人

2022-09-09

传感器对于机器人车辆（RV）中的自动操作至关重要。对传感器篡改或欺骗等传感器的物理攻击可以通过物理通道为RV提供错误的值，从而导致任务失败。在本文中，我们介绍了DeLorean，这是一个综合诊断和恢复框架，用于保护自动RV免受身体攻击。我们考虑了一种强烈的物理攻击形式，称为传感器欺骗攻击（SDA），其中对手同时靶向不同类型的多个传感器（甚至包括所有传感器）。在SDA下，Delorean检查攻击引起的错误，标识目标传感器，并防止错误的传感器输入在RV的反馈控制环中使用。 Delorean在反馈控制循环中重播历史性状态信息，并从攻击中恢复RV。我们对四个真实和两个模拟的RV的评估表明，DeLorean可以从不同的攻击中恢复RV，并确保在94％的情况下（平均）（平均而言）的任务成功，而不会发生任何崩溃。 Delorean会产生低性能，内存和电池开销。

translated by 谷歌翻译

ActKnow: Active External Knowledge Infusion Learning for Question Answering in Low Data Regime

K. M. Annervaz , Pritam Kumar Nath , Ambedkar Dukkipati

分类：机器学习

2021-12-17

深度学习模型在各种自然语言处理任务中设置了基准。然而，这些模型需要巨大的培训数据，这在许多实际问题中是不可行的。虽然各种技术如域适应，但是几个学习技术解决了这个问题，我们介绍了一种积极地将外部知识的新技术引入学习以解决低数据制度问题。我们提出了一种称为Actknow的技术，它基于知识图（KG）的“按需”在学习中，激发了知识图表（KG）的知识（QA）。通过从概念网络中注入世界知识，我们对基于文本的基于文本的变压器模型的临时挑战 - 在低数据制度中的变压器模型上显示了显着的改进。例如，通过仅使用20％的训练示例，我们分别证明了弧形挑战和OpenBookQA的准确性提高了4％。

translated by 谷歌翻译

Disparities in Dermatology AI: Assessments Using Diverse Clinical Images

Roxana Daneshjou , Kailas Vodrahalli , Weixin Liang , Roberto A Novoa , Melissa Jenkins , Veronica Rotemberg , Justin Ko , Susan M Swetter , Elizabeth E Bailey , Olivier Gevaert

分类：计算机视觉 | 机器学习

2021-11-15

超过30亿人缺乏护理皮肤病。AI诊断工具可能有助于早期皮肤癌检测;然而，大多数模型尚未在不同肤色或罕见疾病的图像上进行评估。为了解决这个问题，我们策划了多样化的皮肤科（DDI）DataSet - 这是一种具有不同皮肤色调的第一个公开的，病理证实的图像。我们展示了最先进的皮肤科AI模型在DDI上表现得很糟糕，ROC-AUC与模型的原始结果相比下降29-40％。我们发现暗肤色和罕见的疾病，在DDI数据集中提供良好，导致性能下降。此外，我们表明，无需多样化培训数据，我们表明最先进的强大培训方法无法纠正这些偏差。我们的研究结果确定了需要解决的皮肤病学AI中的重要弱点和偏见，以确保可靠应用于各种患者和所有疾病。

translated by 谷歌翻译

Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Temporal Synchronicity

Pritam Sarkar , Ali Etemad

分类：计算机视觉 | 机器学习

2021-11-09

我们呈现了一个用于学习视听表示的自我监督的框架。在我们的框架中引入了一种小说概念，其中除了学习模态和标准的“同步的”跨模型关系之外，riscross也会学习“异步”的跨模式关系。我们展示通过放松音频和视觉模态之间的时间同步性，网络了解强劲的时间不变的表示。我们的实验表明，音频和视觉方式的强大增强，可放松交叉模态时间同步优化性能。要预先绘制我们提出的框架，我们使用具有不同大小，动力学，动力学-400和augioset的不同数据集。学习的表示是在许多下游任务中评估的，即行动识别，声音分类和检索。 Crisscross显示了动作识别的最先进的性能（UCF101和HMDB51）和声音分类（ESC50）。将公开可用的代码和预赠品模型。

translated by 谷歌翻译

Turning Your Strength against You: Detecting and Mitigating Robust and Universal Adversarial Patch Attacks

Zitao Chen , Pritam Dash , Karthik Pattabiraman

分类：人工智能 | 机器学习

2021-08-11

对图像分类的侵扰贴片攻击攻击图像的深度神经网络（DNN），其在图像的有界区域内注射任意扭曲，可以产生鲁棒（IE在物理世界中的侵犯）和普遍（即，在任何情况下保持对抗的侵犯扰动输入）。这种攻击可能导致现实世界的DNN系统中的严重后果。这项工作提出了jujutsu，一种检测和减轻稳健和普遍的对抗性补丁攻击的技术。对于检测，jujutsu利用攻击“通用属性 - jujutsu首先定位潜在的对抗性补丁区域，然后策略性地将其传送到新图像中的专用区域，以确定它是否真正恶意。对于攻击缓解，jujutsu通过图像修正来利用攻击本地化性质，以在攻击损坏的像素中综合语义内容，并重建“清洁”图像。我们在四个不同的数据集中评估jujutsu（想象成，想象力，celeba和place365），并表明Jujutsu实现了卓越的性能，并且显着优于现有技术。我们发现jujutsu可以进一步防御基本攻击的不同变体，包括1）物理攻击; 2）目标不同课程的攻击; 3）攻击构造不同形状和4）适应攻击的修补程序。

translated by 谷歌翻译